1 GGCGTCGCCG CGCGGGGAGA AAGAAGCCGC GCCCAGCCCG GCGTCCCGAG 
51 CAGCGCAGGG GAGGATCCCC GCGCAGTGAC CCGGGAGCCA CCACAGACTC 
101 TGGGAGGCTC GGCGGCTGGA GCAGCAGGCA GCTCCCCGCA GCTCCCGGCG 
151 CTTCCAGGCA GCTCTCTGAG CCGTGCCAGA GGCCCGGCCC GCCATTCCCA 
201 GCCCCGAGCC ATGATGAAGA CTTTGTCCAG CGGGAACTGC ACGCTCAGTG 
251 TGCCCGCCAA AAACTCATAC CGCATGGTGG TGCTGGGTGC CTCTCGGGTG 
301 GGCAAGAGCT CCATCGTGTC TCGCTTCCTC AATGGCCGCT TTGAGGACCA 
351 GTACACACCC ACCATCGAGG ACTTCCACCG TAAGGTATAC AACATCCGCG 
4 01 GCGACATGTA CCAGCTCGAC ATCCTGGATA CCTCTGGCAA CCACCCCTTC 
451 CCCGCCATGC GCAGGCTGTC CATCCTCACA GGGGATGTCT TCATCCTGGT 
501 GTTCAGCCTG GATAACCGGG AGTCCTTCGA TGAGGTCAAG CGCCTTCAGA 
551 AGCAGATCCT GGAGGTCAAG TCCTGCCTGA AGAACAAGAC CAAGGAGGCG 
601 GCGGAGCTGC CCATGGTCAT CTGTGGCAAC AAGAACGACC ACGGCGAGCT 
651 GTGCCGCCAG GTGCCCACCA CCGAGGCCGA GCTGCTGGTG TCGGGCGACG 
701 AGAACTCCGC CTACTTCGAG GTGTCGGCCA AGAAGAACAC CAACGTGGAC 
751 GAGATGTTCT ACGTGCTCTT CAGCATGGCC AAGCTGCCAC ACGAGATGAG 
801 CCCCGCCCTG CATCGCAAGA TCTCCGTGCA GTACGGTGAC GCCTTCCACC 
851 CCAGGCCCTT CTGCATGCGC CGCGTCAAGG AGATGGACGC CTATGGCATG 
901 GTCTCGCCCT TCGCCCGCCG CCCCAGCGTC AACAGTGACC TCAAGTACAT 
951 CAAGGCCAAG GTCCTTCGGG AAGGCCAGGC CCGTGAGAGG GACAAGTGCA 
1001 CCATCCAGTG AGCGAGGGAT GCTGGGGCGG GGCTTGGCCA GTGCCTTCAG 
1051 GGAGGTGGCC CCAGATGCCC ACTGTGCGCA TCTCCCCACC GAGGCCCCGG 
1101 CAGCAGTCTT GTTCACAGAC CTTAGGCACC AGACTGGAGG CCCCCGGGCG 
1151 CTGGCCTCCG CACATTCGTC TGCCTTCTCA CAGCTTTCCT GAGTCCGCTT 
1201 GTCCACAGCT CCTTGGTGGT TTCATCTCCT CTGTGGGAGG ACACATCTCT 
1251 GCAGCCTCAA GAGTTAGGCA GAGACTCAAG TTACACCTTC CTCTCCTGGG 
1301 GTTGGAAGAA ATGTTGATGC CAGAGGGGTG AGGATTGCTG CGTCATATGG 
1351 AGCCTCCTGG GACAAGCCTC AGGATGAAAA GGACACAGAA GGCCAGATGA 
1401 GAAAGGTCTC CTCTCTCCTG GCATAACACC CAGCTTGGTT TGGGTGGCAG 
1451 CTGGGAGAAC TTCTCTCCCA GCCCTGCAAC TCTTACGCTC TGGTTCAGCT 
1501 GCCTCTGCAC CCCCTCCCAC CCCCAGCACA CACACAAGTT GGCCCCCAGC 
1551 TGCGCCTGAC ATTGAGCCAG TGGACTCTGT GTCTGAAGGG GGCGTGGCCA 
1601 CACCTCCTAG ACCACGCCCA CCACTTAGAC CACGCCCACC TCCTGACCGC 
1651 GTTCCTCAGC CTCCTCTCCT AGGTCCCTCC GCCCGACAGT TGTGCTTTGT 
1701 TGTGGTTGCA GCTGTTTTCG TGTCATGTAT AGTAGTAGAA ATGGAAATCA 
1751 TTGTACTGTA AAAGCCTAGT GACTCCCTCC TTGGCCAGGC CCTCACCCAG 
1801 TTCAGATCCA CGGCCTCCAC CCGGGACGCC TTCCTCCTCT GCTCCCAAAC 
1851 AGGGTTTCCG TGGCCTGTTT GCAGCTAGAC ATTGACCTCC GCCATTGAGC 
1901 TCCACGGTTT ACAGACAATT GCACAAGCGT GGGGTGGGCA GGCCAGGACT 
1951 GCTTTTTTTT AATGCTCCCA TTTCACAGAG GATACCACCG AGACTCGGAG 
2001 GGGACACGAT GAGCACCAGG CCCCACCTTT GTCCCCTAGC AAATTCAGGG 
2051 TACAGCTCCA CCTAGAACCA GGCTGCCCTC TACTGTGCTC GTTCCTCAAG 
2101 CATTTATTAA GCACCTACTG GGTGCTGGGT TCACTGTGTC CTAGGAAACC 
2151 AAGAGGGTCC CCAGTCCTGG CCTCTGCCCG CCCCTGCTGC CCCACCACCT 
2201 TCTGCACACA CAGCGGTGGG GAGGCGGGGA GGAGCAGCTG GGACCCAGAA 
2251 CTGAGCCTGG GAGGGATCCG ACAGAAAAGC TCAGGGCGGG TCTTCTCCTT 
2301 GTGCCCGGGA TTGGGCTATG CTGGGTACCA CCATGTACTC AGGCATGGTG 
2351 GGTTTTGAAC CCATAAACCA AAGGCCCTTG TCATCAGCTC TTAACAAGTA 
2401 TATTTTGTAT TTTAATCTCT CTAAACATAT TGAAGTTTTA GGGCCCTAAG 
2451 GAACCTTAGT GATCTTCTAT TGGGTCTTTC TGAGGTTCAG AGAGGGTAAG 
2501 TAACTTCCTC CAGGTCACAC AGCAAGTCTG TGGGTGGCAG AAGCAAGCTA 
2551 GCGCTGGGCA TTCAGTACAT ACCACGATGT GCTCCCTCTC TTGATGCTTG 
2601 GCCCCTGGGG CCTTCAGGGC TTTGGGACAT CTTGTCCTCA ACCCTCTCCC 
2651 TAGATCAGTC TGTGAGGGTC CCTGTAGATA TTGTGTACAC CATGCCCATG 
2701 TATATACAAG T AC AC AC AG A TGTACACACA GAT G T AC AC A TGCTCCAGCC 
2751 CCAGCTCTGC ATACCTGCAC CTGCACCCCA GCCTTGGCCC CTGCCTGCGT 
2801 CTGTGCTCAA AGCAGCAGCT CCAACCCTGC CTCTGTCCCC TTCCCCACCC 
2851 ACTGCCTGAG CCTTCTGAGC AGACCAGGTA CCTTGGCTGC ACCGGTGTGT 
2901 GGCCCGCTCT CACCCAGGCA CAGCCCCGCC ACCATGGATC TCCGTGTACA 
2951 CTATCAATAA AAGTGGGTTT GTTACAAAAA AAAAAAAAAA AAAAAAAAAA 
3001 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
3051 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 
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FEATURES : 

5'UTR: 1-210 

Start Codon: 211 

Stop Codon: 1009 

3 ? UTR: 1012 

5' UTR ANALYSIS : 

Query=cDNA clone 
Sbjct: genomic sequence 

5' UTR Exon 1, non-coding 

Score = 58.0 bits (29), Expect = 2e-10 

Identities - 29/29 (100%) 

Strand = Plus / Plus 

Query: 1 ggcgtcgccgcgcggggagaaagaagccg 29 

I I M 1 I t i M i I I I i I 1 I I I I 1 i I t I I M 
Sbjct: 1535 ggcgtcgccgcgcggggagaaagaagccg 1563 

5' UTR Exon 2, non-coding 
Score = 349 bits (176), Expect = 3e-98 
Identities - 176/176 (100%) 
Strand = Plus / Plus 

Query : 2 6 gccgcgcccagcccggcgtcccgagcagcgcaggggaggatccccgcgcagtgacccggg 85 

I M ! I t I I I I I I 1 I i i I i M I I i ! 1 I i I I 1 I I M i i I I i M I M t I I I i I I I I M i t I 1 I 
Sbjct: 2001 gccgcgcccagcccggcgtcccgagcagcgcaggggaggatccccgcgcagtgacccggg 2060 

Query: 86 agccaccacagactctgggaggctcggcggctggagcagcaggcagctccccgcagctcc 145 

f | 1 | i | M t t I I t i I I i I 1 I I 1 ! 1 I I 1 I i M 11 I I II i t 1 i ! I I! I II i I I I I I S I I I I I 
Sbjct: 2061 agccaccacagactctgggaggctcggcggctggagcagcaggcagctccccgcagctcc 2120 

Query: 14 6 cggcgcttccaggcagctctctgagccgtgccagaggcccggcccgccattcccag 201 

M M I I I i E I I I I I I M I I I I I 1 I I I I I I t I M ! I I I I I I I I i I 1 i I i I M 1 I I I I 
Sbjct: 2121 cggcgcttccaggcagctctctgagccgtgccagaggcccggcccgccattcccag 217 6 

Score = 563 bits (284), Expect = e-162 
Identities = 284/284 (100%) 
Strand = Plus / Plus 

5' UTR Exon 3, Protein coding region begins at nucleotide 211 

Query: 199 cagccccgagccatgatgaagactttgtccagcgggaactgcacgctcagtgtgcccgcc 258 

M I I I I I 1 1 1 1 1 I E ! I 1 I I I I i M M I I I I I I ! I I E i I M M I II t i I M I I I II I M I I 
Sbjct: 7474 cagccccgagccatgatgaagactttgtccagcgggaactgcacgctcagtgtgcccgcc 7533 

Homologous proteins : 

Top BLAST Hits 

CRA| 18000005194969 /altid-gi 1 10047088 /def=ref |NP_055125 . 1 1 sim. 
CRAI18000005238449 /altid-gi i 5059122 /def-gb 1 AAD38928 . 1 1 AF134 40 . 
CRA| 332000009620725 /altid=gi I 8118457 / def =gb | AAF72997 . 1 1 AF2 620 . 
CRA1 18000005232775 /altid-gi [ 7706359 /def-ref | NP_057168 . 1 1 ras-. 
CRA| 87000000006130 /altid-gi I 72307 68 / de f — gb | AAF4 30 90.11 AF2 3915. 
CRAI18000005090459 /altid-gi I 6677 673 /def=ref I NP_033052 . 1 j RAS, . 
CRA| 89000000197633 /altid=gi I 7295299 / def =gb I AAF50620 . 1 I (AE003. 
CRAI105000014645240 /altid-gi | 10503969 / def=gb | AAG17 97 9 . 1 | AF177 . 



Score 


E 


538 


e-152 


512 


e-144 


342 


2e-93 


342 


2e-93 


342 


3e-93 


341 


6e-93 


228 


8e-59 


214 


le-54 
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BLAST dbEST hits: 



gij 9345313 /dataset=dbest /taxon=960. . 
git 9335874 /dataset=dbest /taxon=960.. 
gi 110143211 /dataset=dbest /taxon=96.. 
gi I 9335309 /dataset=dbest /taxon=960 . . 
gi I 9150610 /dataset=dbest /taxon=9606. 
gi [ 10144589 /dataset-dbest /taxon=96. . 
gi|9333908 /dataset-dbest /taxon=960.. 



880 
846 
821 
813 
662 
617 
599 



0.0 
0.0 
0.0 
0.0 
0.0 



e-174 
e-169 



EXPRESSION INFORMATION FOR MODUIATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 

gi I 9345313 Placenta choriocarcinoma 

gi I 933587 4 Uterus endometrium adenocarcinoma 

gi! 10143211 Skin melanotic melanoma 

gi i 9335309 Uterus endometrium adenocarcinoma 

gi I 9150610 Skin melanotic melanoma 

gi 1 10144589 Skin melanotic melanoma 

gi I 9333908 Uterus endometrium adenocarcinoma 

Expression information from PCR-based tissue screening panels: 
Human fetal whole brain 
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1 MMKTLSSGNC TLSVPAKNSY RMWLGASRV GKSSIVSRFL NGRFEDQYTP 
51 TIEDFHRKVY NIRGDMYQLD ILDTSGNHPF PAMRRLSILT GDVFILVFSL 
101 DNRESFDEVK RLQKQILEVK SCLKNKTKEA AELPMVICGN KNDHGELCRQ 
151 VPTTEAELLV SGDEN SAYFE VSAKKNTNVD EMFYVLFSMA KLPHEMSPAL 
201 HRKISVQYGD AFHPRPFCMR RVKEMDAYGM VSPFARRPSV NSDLKYIKAK 
251 VLREGQARER DKCTIQ 

FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOS YLAT I ON 
N-glycosylation site 

Number of matches : 2 

1 9-12 NCTL 

2 125-128 NKTK 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

Number of matches: 4 

1 84-87 RRLS 

2 174-177 KKNT 

3 202-205 RKIS 

4 236-239 RRPS 

[3} PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 2 

1 19-21 SYR 

2 172-174 SAK 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 5 

1 51-54 TIED 

2 105-108 SFDE 

3 154-157 TEAE 

4 161-164 SGDE 

5 177-180 TNVD 

[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

220-228 RRVKEMDAY 

[6] PDOC00266 PS00294 PREN YLAT I ON 
Prenyl group binding site (CAAX box) 

263-266 CTIQ 

[73 PDOC00016 PS00016 RGD 
Cell attachment sequence 

63-65 RGD 

[83 PDOC00017 PS00017 ATP_GTP_A 
ATP/GTP-binding site motif A (P-loop) 

2 6-33 GASRVGKS 



FIGURE 2, page 1 of 3 



BLAST Alignment to Top Hit: 

>CRA] 18000005194969 /altid-gi [ 10047088 /def =ref I NP_055125 . 1 | similar 
to mouse Ras, dexamethasone-induced 1; tumor endothelial 
marker 2 [Homo sapiens] /org=Homo sapiens /taxon=9606 
/dataset=nraa /length=278 
Length = 278 

Score = 538 bits (1372), Expect = e-152 

Identities - 265/266 (99%) , Positives = 265/266 (99%) 

Query: 1 MMKTLSSGNCTLSVPAKNSYRMWLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVY 60 

MMKTLSSGNCTLSVPAKNSYRMVVLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVY 
Sbjct: 13 MMKTLSSGNCTLSVPAKNSYRiyiWLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVY 72 

Query: 61 NIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVK 120 

NIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVK 
Sbjct: 73 NIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVK 132 

Query: 121 SCLKNKTKEAAELPiyiVICGNKNDHGELCRQVPTTEAELLVSGDENSAYFEVSAKKNTNVD 180 

SCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDEN AYFEVSAKKNTNVD 
Sbjct: 133 SCLfCNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENCAYFEVST^KNTNVD 192 

Query: 181 EMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSV 240 

EMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSV 
Sbjct: 193 EMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSV 252 

Query: 241 NS DLKY I KAKVLREGQARERDKCT I Q 266 

N S DLKY I KAKVLREGQARERDKCT I Q 
Sbjct: 253 NSDLKYIKAKVLREGQARERDKCTIQ 278 

Complete Amino Acid Sequence of gi 110047088 / def =ref | NP__055125 . 1 i similar 
to mouse Ras, dexamethasone-induced 1; tumor endothelial 
marker 2 [Homo sapiens] /org=Homo sapiens /taxon=9606 
/dataset=nraa /length=27 8 
Length - 278 

NOTE: UNDERLINED RESIDUES NOT PRESENT IN THE PROTEIN OP THE PRESENT INVENTION 

1 mpaslallqp ram mktlssg nctlsvpakn syrmvvlgas rvgkssivsr flngrfedqy 
61 tptiedfhrk vynirgdmyq ldildtsgnh pfpamrrlsi ltgdvfilvf sldnresfde 
121 vkrlqkqile vksclknktk eaaelpmvic gnkndhgelc rqvptteael Ivsgdencay 
181 fevsakkntn vdemfyvlfs maklphemsp alhrkisvqy gdafhprpfc mrrvkemday 
241 gmvspfarrp svnsdlkyik akvlregqar erdkctiq 

>CRA| 18000005238449 /altid-gi j 5059122 /def=gb ! AAD38928 . 1 ! AF134409JL 
(AF134409) Rhes protein [Rattus norvegicus] /org=Rattus 
norvegicus /taxon=10116 /dataset=nraa /length=266 
Length = 266 

Score = 512 bits (1304), Expect = e-144 

Identities - 252/266 (94%), Positives - 256/266 (95%) 

Query: 1 MMKTLSSGNCTLSVPAKNSYRMWLGASRVGKSSIVSRFLNGRFEDQYTPTIEDFHRKVY 60 

MMKT L S SGNCT L+ VP AKN SYRMVVLGAS RVGKS S I VSRFLNGRFEDQYT PT I E DFHRKVY 
Sbjct: 1 MMKTLSSGNCTLNVPAKNSYRMWLGASRVGKSSIVSRFLNGRFEDQYTPT IE DFHRKVY 60 



Query: 61 NIRGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDNRESFDEVKRLQKQILEVK 120 

NI GDMYQLDILDT SGNHPFPAMRRLS I LTGDVFI LVFSLD+RESFDEVKRLQKQI LEVK 
Sbjct: 61 NIHGDMYQLDILDTSGNHPFPAMRRLSILTGDVFILVFSLDSRESFDEVKRLQKQILEVK 120 

Query: 121 SCLKNKTKEAAELPMVICGNKNDHGELCRQVPTTEAELLVSGDENSAYFEVSAKKNTNVD 180 

SCLKNKTKEAAELPMVICGNKNDH ELCRQVP EAELLVSGDEN AYFEVSAKKNTNV+ 
Sbjct: 121 SCLKNKTKEAAELPMVICGNKNDHSELCRQVPAMEAELLVSGDENCAYFEVSAKKNTNVN 180 

FIGURE 2, page 2 of 3 



Query: 181 EMFYVLFSMAKLPHEMSPALHRKISVQYGDAFHPRPFCMRRVKEMDAYGMVSPFARRPSV 240 

EMFYVLFSMAKLPHEMSPALH KISVQYGDAFHPRPFCMRR K AYGMVSPFARRPSV 
Sbjct: 181 EMFYVLFSMAKLPHEMSPALHHKISVQYGDAFHPRPFCMRRTKVAGAYGMVSPFARRPSV 240 



Query: 241 NSDLKYIKAKVLREGQARERDKCTIQ 266 

NSDLKYIKAKVLREGQARERDKC+IQ 
Sbjct: 241 NSDLKYIKAKVLREGQARERDKCSIQ 2 66 

Hmmer search results (Pfam) : 

Model Description 

PF00071 Ras family 

CE00060 CE00060 rab ras like 



Score E-value N 

126.2 2.8e-36 1 
20.7 0.00013 2 



Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

CE00060 T/2 22 52 . . 26 56 . . 16.9 0.0014 

CE00060 2/2 162 183 159 181 .. 3.3 7.5 

PF00071 1/1 21 186 .. 1 169 [. 126.2 2.8e-36 
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1 CTCTCTGACT CTTTGCCTCC TCTCTGACTC CCTGCCTCCT CTCTCTGTCT 
51 CCCTGCCTCC TCTGTCTGAC TCCCTGCCTC CCCTCTCTGT CTCACTGCCT 
101 CCTCTCTCTG ACTCTCTGCC TCCTCTCTCT GACTCCCTGC CTCCTCTCTC 
151 TGATTCCCTG CCTCTTTGAC CCTCTGCCTC CTCTCTTTGA CTCCCTGCCT 
201 CCTCTCTCCG ATTCTCTGCG TCTTTGACTC CCTGCCTCCT CTCTCTGACT 
251 CCCTGAAGCT CATTCAGTCA TTGCTATCAA CTCGTCTGTA CCAAGCTCTA 
301 GGCTGGAGGC TGGGCAGGGC AATGATGGAG ACAAATACTG TCCCTGGGAG 
351 CTTCTGGCCC CTTTCCCATC CTGTTTAGAC AGAAGTGACC GCCAGCAGAG 
401 TCAAGCTGTC TGCAGAAGGA CTTGGGGAGG GGGCTGTCAT GGGGTAGGGC 
451 TTCTTTCCCC CCATCTCTGC TGAAGGCCCA GGCTGGCTGA GACAGCCCCG 
501 GCAGAGACTG AGAAGGGCTC CCTGCTGTGG TCTGGCAGCC CCCTCTCCAC 
551 CCTCCTCTCT CTCATTTCCT GCCTCCCACA CGTATGCCCT GGGCACCTCA 
601 TCAGGGCTGC CCTAGGGGAG GGCCCTCCTT GGCACAGCCC CTGGGCCAGT 
651 CAGGTGGTTG AGGCTGAGGA GAGAAGGTCC CAGAGTGGGG CTTCAGGCAA 
701 ACCCAAAGAC AGAGCCCTTT GCCATTTGAT GAATGCACAG ACCCTTTATT 
751 GAGCCCCTGC TCTGTTCATG GCATGGCAGT TTTGTGGGAT AAATTCAAAG 
801 ACAGCTTTAG GTGGGAGCTG GGTGGGGGAT GTGGGGGTCT TAGGCTTGAA 
851 CTACTACCCA GCCTCCTTTG TTAACCAAGT AGCTAGTCAC GTAGCCTTCT 
901 GAGCTCGGGG CAGACCACCT GGGATCAAAC CTCTCCTCTG CTGGTTACTG 
951 GCTGTGCAAC TGTAAGCAAG TAATTTAACC TCTCTGTGCC TCAGTTTCCT 
1001 CATCTGTAAA TTGGAGAATA ACACCACCTG CTTTCTGGGG TTATGAAGGG 
1051 AGAAATAGGT TAACATGTGT GCAGCACTTA GAACACTCTG GCATATTTTA 
1101 GCTGCAAAAT GAATGCCAGC TATGATTATT TCTATACTTA GTGCGGGGCT 
1151 TGGCACACTG CATGGGCTCA AGTGGCAGCA GTTGTCGTCC TTGTGGCTCC 
1201 AGGCCTGGGG TCCGCCGTGT GCTGAGCTGG CTTATTGTGC ACGTCCCTTT 
1251 GTGATTCATT CATCGAAGTC ACATTAGTAG CTTAGAAGTG ACCGTAGTGG 
1301 GAGCATTTAC GCCATGGAAA TTGGCAATAG GGCTTTTAAC AAAGGTATTT 
1351 TTGAGAGCCG GTTTCCTGCA CAGAGGCTGG TAGTTGGGCA GGGTGAGCAG 
1401 ATCCAGATGT GTGCCAGGGA CTCGCACGCA GGCAATCTCT CCACCTCCAG 
1451 TGGCCATCTC AG AC C T TAG C TTCATGATAG CCAGGAAGCG ATGGTGTTGG 
1501 AAAGCGCCTT GGGTCAATGG GCGAGGCACT CAAGGAAACC GACTTGGGGC 
1551 ATCCTGGGGT GGGGACCGAG TTTGGGCACA TACAGCCCTT TGTGTGAATT 
1601 TAAAAACAGT GCCTTTTCCT CTACACAAGA TGCCCTTTCG TCTGGGATAC 
1651 AGCCCCCACC TCTGGGATGC AGCCCCCACT TGCCCACCCA GCCATGCGCC 
1701 TTGTGCAGTA TCCAACCTGC ACAACCTGTG GCAGCCTGTG GAAGACCGAG 
1751 GGGATTGATA TTTCAGCAGG CCTGTGCCCA TTTGCAGTTC AGGGGCTGGA 
1801 AAGCTCTCCT CTGGAGAGGG GAGGGATTCC TGCAAGGGTG AGGAGATCAG 
1851 AGAGGCCTTC AGAGAGCAGG TGGCACTTGA GCCAGACCCT GAAACATAAG 
1901 GGGAAGAGGG TGTTCTGCAG AGGGGTGGCA TGAGCAAAGG AGTGGAGGCT 
1951 GATCTCAGCA GAGCTCAAAC TGACGAGGGT GACTGGGGTC AGGGGTTCTG 
2001 GGGCGGGGAT TCTGGTGGGC GCTAAGGTAG GAAAGGAGGG AGGGCTGGGC 
2051 TGTGAAGAGC CTTTGGGGTG AGCCTGGTGG AGCCTGCGGG TTTGCTTATA 
2101 CAAGAGCTTG GATCCATGTC GGCCTCTTTC ATGAGGTCAA GAGGCTCCCA 
2151 TAGAAAGCTC TGAGTTTGCC CCAGAACCAT AACCCTTGGA GATGGGAGGG 
2201 AAGCTTGAGC CAGCCATGGG TCGTTCCCCA TTCCACATCC TCTACTCCGG 
2251 GCCTCTGGGT CTCCTGGAGG CAAGTAAACA CCTAGGGCCT GGGAGGCAAA 
2301 AATATCCGGG CAGGTCATGG AGCGGAGGGA GCCCGCCAGA TGCAGAGCAC 
2351 AGGTCTAAAG GTGGGTCCTC CTGAGGTGGC TGCAGGAGCA ACCCCAGGCA 
24 01 TTGGGCTTGG AGCATGCGGT GTGGACATAG CCTTCCCTTC TTCCCAGGAG 
2451 GGCTGAATGG CCACAGAACC ACCCCCTGCC CCAGGCTTAA GAAATGCATG 
2501 CTAGTGCCTT CCCCATGTCT TATCCTAGAA TCACAGGCTC CGGGAAAGCC 
2551 AGATGGATGA ACCAGGGAAA GAACGGATTC TCACCATAGA TACCATTTTT 
2601 GAGATTTCAC CATGTGCTGA GCCCTTTGCA ACAACTCTAT GAATTGGGCT 
2 651 CATTTTGCAG AT G AG AAAAG TGACTTCTAG AGAGGTTAAG CTACTAGCCC 
2701 AAGATCAGTA GCTAGAGGCA AGGCAAGGAT TCAAATCCCA GGAGTCCGGT 
2751 GCTTGCATAA ATGAAAGGAT GAATGAACGG ATATTGAGTG AGTGAGTGGA 
2801 TGAAGGAAGG AGTAAAGGAG AGGGCATGAA TGAATGAGAG GGTAGAACTC 
2851 CAAGACCCCT TAGAACCTCG TCTGATGTTC CCATTTTACA GACAGAAAAC 
2901 TGAGTCCTAG ACAGAGGCCT AGAGGAGGCC AAGAGGTGGT GGGGCCAGGT 
2 951 CGGGGGGGCC CTGATGCCTG CTTCTCTCGC TTTGTTGCAG CCCCGAGCCA 
3001 TGATGAAGAC TTTGTCCAGC GGGAACTGCA CGCTCAGTGT GCCCGCCAAA 
3051 AACTCATACC GCATGGTGGT GCTGGGTGCC TCTCGGGTGG GCAAGAGCTC 
3101 CATCGTGTCT CGCTTCCTCA ATGGCCGCTT T GAG G AC C AG TACACACCCA 
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3151 CCATCGAGGA CTTCCACCGT AAGGTATACA ACATCCGCGG CGACATGTAC 
3201 CAGCTCGACA TCCTGGATAC CTCTGGCAAC CACCCCTTCC CCGCCATGCG 
3251 CAGGCTGTCC ATCCTCACAG GTGAGGCCCA CTGGTGCCTG GGCTGGGGCG 
3301 GCAGGGCCAG GGCATGGGTG CGGAGTGTGC TGGGCACTTG GCAGTTTGCA 
3351 TAGACTTGCA TAGCCATCGT CTGAGACAGG CGTCATCCCT GCACAATGAG 
34 01 GCTCAGAGAG GTTTTGCCAT GTGCTGGAAA TAGTGATGAA GTCGGGGGCC 
3451 CCGATTCCAT TCTGTTAGAC TCCAGATCGA TTACTCATGG CTGTCGGGGC 
3501 CGCCTTCCAG ATCAGGAGCT GATACCAGCA TGCCCCAGGG ATATTCCTTT 
3551 CTAGGGAACA GAATGATGCC CTGGCTGCTG CTTTCCTTCT CCGGAAGATG 
3601 ACCCACCAGA GCTCCAGGGC CCAAGGTCAG TCCACGGGGC TCAGGTCTCC 
3651 CACACCCCAG GCCTTTGCCA CCTCCTAGAG AGGTAAGGGC AGGACCCAGG 
3701 CAGTGATCAC CAAAGGGAAG GGGGCTTGGT CATGGTCATA GTGATGGTGA 
3751 TGGCACTAGC TGACACTTAT CAGAAGCTAT GGGCCTGGCC CTGTTCTTAG 
3801 AGCTTGGCAT GTAGTTTTTT TTGAAACAGA GTCTCGCTCT GTCACCCAGG 
3851 CTGGAGTGCA GTGGCGCGAT CTCGGCTCAC TGCAACCTCT GCCTCCCGGG 
3901 TTCAAGCGAT TCTCCTGCCT CAGTCCCCCA AGTAGCTGGG ACTACAGGCA 
3951 CGTGCCACCA TGCCCGGCTA ATTTTTTGTA TTTTTACTAG AGACGGGATT 
4 001 TCACCATGTT AGCCAGGATG GTCTCGATCT CCTGACCTCG TGATCTGCCT 
4 051 ACCTCAGCCT CCCAAAGTGC TGAGATTACA GGCGTCAGCC ACCGCGGCCA 
4101 GCCAGCATGT AGTTATTTAA CCCTCACAGT AAATAGTTAT TCATTCCCTT 
4151 TTTACAGGTG GGGAAACTGA AGCCCAGAGA GGTTAAGTAA CTCACTCCAG 
4201 TGGTAGCACA GCTCGTAAAG GCAGTCTGCT TTTGTGCTTT CAGACAAAGC 
4251 C AT AC C AC AG CCTCTCAGCC CTGCTGGGAA GGGTGAGGAG GGACAGGGAG 
4301 GTTGGGGGGA AGAAGGGGTG AGTGGAGCTG AGGGGCTGTG CCCTTGTTTA 
4351 CACTGCATTA GCATGGTAGC TAAGAGGACA AGCCCGGGAC CCAGCACCTG 
4 401 GGTGTGAGCC CTGGTTCCGC TGCTTCCTGG CTTTGTACCT CGAGGCAAGG 
4451 GATTTTATCT CCTTGTGTCT CAGCACTCTC ATCTGTAAGA CTGCACCACA 
4501 TCAACACTCA TCCTAAAGGG ACT GT GAG AC TTAAATGAAT GAATATATGT 
4 551 AAGGCGCTTG GTGAGCAGAT AGTAAATGCA CAATAAATCC CCAAGTCTTC 
4 601 TGTTGAGTCA GCATTTGCAA GTGGGCCTGC TACGGGTTAC AC GAT CAT TT 
4 651 CCCAAGTCAC GCCCCTGAAG TTGCTGAGCA GGGATAAGGG AAGGAGTGAG 
4701 CAGGCAACTC TCTAGGCATC ATTCAGATAA CCCCCCAACT GAGGTACTTC 
4751 TATACAGAGA AACCCATGCC ACTCCCAGCC CTGCTGCCGC CTTGAGGCCA 
4 801 AGACTGAGGC TGCGGGGTGG CCCCTCCTTG AGTGCTTTCT CTTCCAGGCT 
4 851 GGCTTTTCCG AGCATCTGAC CCAGACAGCA GTCAAGTTCT CCGCTCCACC 
4 901 CCGAGTTTTG GAGAAGGGGC ATGTAGATAG GAGAGCCCTG GGTCGACCTG 
4 951 TGTTCGAATC CTTGCTGGGC CTCTTGGCTT AAATGTGTGA CCAGAGGCAC 
5001 ATGCATCCTG TCTGAGTCTC AGCCTCCCCA GCCACACAGT GGGCTTAACC 
5051 TCATACCCCG CAGGGAGGCT GTGAGGACTG CAAGAAGGCT TGTGGCGGGA 
5101 GCTTCCAGCA CGTGACGGGT ATTGCATTGG TGTCAGCTCC CCCAGCCTTG 
5151 GGGAGGGGAC TGGGTACCCG CTGCAATGAA TAAGGCTAAT GACAGAGGGA 
5201 AGGAGAGGGG AGATGTAGAG AGGAAGCACA TGCATATTTT CAGCATTAAT 
5251 TTTCAGTGAC ACAAGTAATA CCCAAACACA CCCTCCTGCA AACGCTACAG 
5301 ATAAAGCTAA TGCCCCTTTG ACCCATGTCC CCAATCCCAG GCTCCTGCCC 
5351 CTGCCCCGGA GGTGGCCACC CTGGCAGTCT GGCATGGAGC CTTCCGGGCC 
54 01 TCCGTGACTA CACCGGCATT CGTATTTGTA TCCCCACAAT GGAGAGTATT 
54 51 TTTGTCTGTC TCTTTTTTAT GGCGCATATC ATTCTGAGCA CAGCTGTCTG 
5501 ATGCTTGTTT TTTTTTCACA CACCAACCCG TGCCTCATTT TCCAACCTGG 
5551 TGGAACCTCA TTTTTTCAAC CTCATTTTCC TGCTGCTCAG GAAATTCTGA 
5601 AAGCCATTAA TTCCACTGCC AGCTCTTCTT CCAGCTGCCA GACGGGCCGA 
5651 TCTCTGATGC TTGGCATCGC AGTCTCGCAT TTGAATATGT CAAGGCCACG 
57 01 AGTCCTCAGG GGCCCCGGAT TAGCCTAACG GGATGGGGTT TGGCAGCCCA 
5751 AGCAGGAAGA GTTGCCAAGC TGACGCTGCC TCGCAAGTGC CTTTCAGAAG 
5801 AGCCCACACT GCAGTTCCCT CCATCACCTC CCATCCATTC AGGCTTCCTT 
5851 GGTTAACACT GACTGTGTGC CAGGCCCTGG GGAGACCAGG ACGAGTGGGT 
5901 GATGGAACCC TTCTCTGTGC CCGAGCTGTT TGGAGCACAC CTTTGATCTG 
5951 GACACCATTC TGAATGTGCC ATGTGCCATT AAATGGGGGT AAATGATGTG 
6001 CTCTGGGGGT GCAGAGGAAG GTGGCAGCCA TTCTGCCAGA AGCTGGAACT 
6051 GGTTGCTTCT CTTCTCAAGA ATTTGGGCCA ATTGCTGATT CCTCTGGGCC 
6101 TCAGTTTCCT CATCTGTGAG ACAGGGATCT TGTCACACCA CAAGGCTATC 
6151 AAGAGTTTGA GCAAAAGTGG TTGGACGCAG TGGCTCATGC CTGTAATCCC 
6201 AGCTCTTTGG GAGGCCGAGG TGGGCAGATC TCTTGAGGTC AGGAGTTCAA 
6251 GACCAGCCTG GCTAACACAG TGAAACACCG TCTCTACTAA AAAATACAAA 
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6301 AAATTAGCCA GGTGTGGTGA TGGGCACCTG TAATCCCAGT TACTCGGGAG 
6351 GCTGAGGCAG GAGAATCTCT TGAACCCAGG AGGTGGAGGT TGCAGTGAGC 
6401 TGAGATCTTG CCATTGCATT CCAGGCTGGG CAACAAGAGT GAAACTCTGT 
6451 CTCAGAAAAT AAATAAATAA ATAAATAAAA AATAGCTAGG CATGGTGACA 
6501 GGCGCCTGTA ATCCCAGCTG CTCCAGAGGC TGAGGCAGGA GAATCGCTTA 
6551 AACCCAGGAG GTGGAAGTTG CAGTGAGCCA AG AT C AC AC C ACTGCACTCC 
6601 AGCCTGGGCC ACAGAGAAAG ACTCCATCTC AAAAAAAAAA AAAAAAAAAA 
6651 AAAAAGTTTA AGCAAAAGTG AGGAAGGTGC TTATTAAAAG CTGGAAATCA 
6701 GGATGGAGGT ACCAGTCCAG ACAGCCTCCC CACCACCCCA CCGTCTCCAC 
6751 AGCAGCCCCT GTTTCAGATT CACAAGCCTG CCTTGAGTGA TGCAGTGAGT 
6801 TATCCTGGAG GCAGTGTGGG CCTTGGAGGC CAGCACTCAC TTTTTCATCC 
6851 TATGATTTAT TTGAGAAGCA GAGAGCACCT ACCGGGTGCC AGGAACGAGC 
6901 TAGGTGAGAA CAGAATCAGG TAGAAATCTC AGCCTAGCCA CACGGAAGCT 
6951 GTGTGATCTT GGGCAGGCTG CATACCCTTT CTGAGCCTCA GTTTGCTCAC 
7 001 CTGTAATGCA AAGGTAACAA AATCTTGACA GAGGCATAGT GAGGAATCAA 
7 051 GAGAACAACG GGCCTGGAGC ATACACCCAG TGCTTAGCCC CCAGTAGGCC 
7101 CTCACTCTCA TCATTACTGA CACCTGAGGT CACTGAGCAT GTGCCACTGT 
7151 CCATTCATTA TCTTGCATAA CTCCCAAAAT CATCCTGCAA GGTAATATTT 
7201 CATCTTCATG AAACAGACAG AGAAACTGAG GTTACAGAGG TTTCGTGATC 
7251 TGCCCAAGTC TGCTGGCAGC TAAGCGGATG AGGCCAGATG CAAACTAGGC 
7301 ATTGAGCAAG ACAGGCAGGA CCCCTGCTCT CATAGAAATG ATTTTTATTA 
7351 TTATCTGAAC ACAGTCCACA CAAGTGACCT ACCCCTCTCC AGCCCTGCAA 
7 401 AGAAATGTGA AGTGAGTTAA CTGTATTTGA ACCAAGTGGT CCACGTGTTA 
7 451 GCTATGCGAC TGTGAACAGG GGCTTCAACC CCCTCAGCCT CAGTTTCCTG 
7501 TCCTGGAAAA TAATCGCAGG GAGAATAATC GCAGCTACCC CGAAGAGTCG 
7 551 CTGTGTAGGT TAAAGCAGTT ATGCCGCATA ACTGCTTCAG GGCACCTGTG 
7 601 ACTCCCAGCT CTTAGGGCTG ATGTTCTGTG GCCAGAGGAG GGCAGGGGTT 
7 651 GCAGCTGGCC GGTGAACTCA CTACCTGGGC TCTCTCCCTG CAGGGGATGT 
7701 CTTCATCCTG GTGTTCAGCC TGGATAACCG GGAGTCCTTC GATGAGGTCA 
7751 AGCGCCTTCA GAAGCAGATC CTGGAGGTCA AGTCCTGCCT GAAGAACAAG 
7801 ACCAAGGAGG CGGCGGAGCT GCCCATGGTC ATCTGTGGCA ACAAGAACGA 
7 851 CCACGGCGAG CTGTGCCGCC AGGTGCCCAC CACCGAGGCC GAGCTGCTGG 
7 901 TGTCGGGCGA CGAGAACTGC GCCTACTTCG AGGTGTCGGC CAAGAAGAAC 

7 951 ACCAACGTGG ACGAGATGTT CTACGTGCTC TTCAGCATGG CCAAGCTGCC 
8001 ACACGAGATG AGCCCCGCCC TGCATCGCAA GATCTCCGTG CAGTACGGTG 
8051 ACGCCTTCCA CCCCAGGCCC TTCTGCATGC GCCGCGTCAA GGAGATGGAC 
8101 GCCTATGGCA TGGTCTCGCC CTTCGCCCGC CGCCCCAGCG TCAACAGTGA 
8151 CCTCAAGTAC ATCAAGGCCA AGGTCCTTCG GGAAGGCCAG GCCCGTGAGA 
8201 GGGACAAGTG CACCATCCAG TGAGCGAGGG ATGCTGGGGC GGGGCTTGGC 
8251 CAGTGCCTTC AGGGAGGTGG CCCCAGATGC CCACTGTGCG CATCTCCCCA 
8301 CCGAGGCCCC GGCAGCAGTC TTGTTCACAG ACCTTAGGCA CCAGACTGGA 
8351 GGCCCCCGGG CGCTGGCCTC CGCACATTCG TCTGCCTTCT CACAGCTTTC 
84 01 CTGAGTCCGC TTGTCCACAG CTCCTTGGTG GTTTCATCTC CTCTGTGGGA 
8451 GGACACATCT CTGCAGCCTC AAGAGTTAGG CAGAGACTCA AGTTACACCT 
8501 TCCTCTCCTG GGGTTGGAAG AAATGTTGAT GCCAGAGGGG TGAGGATTGC 
8551 TGCGTCATAT GGAGCCTCCT GGGACAAGCC TCAGGATGAA AAGGACACAG 

8 601 AAGGCCAGAT GAGAAAGGTC TCCTCTCTCC TGGCATAACA CCCAGCTTGG 
8 651 TTTGGGTGGC AGCTGGGAGA ACTTCTCTCC CAGCCCTGCA ACTCTTACGC 
87 01 TCTGGTTCAG CTGCCTCTGC ACCCCCTCCC ACCCCCAGCA CACACACAAG 
87 51 TTGGCCCCCA GCTGCGCCTG ACATTGAGCC AGTGGACTCT GTGTCTGAAG 
8801 GGGGCGTGGC CACACCTCCT AGACCACGCC CACCACTTAG ACCACGCCCA 
8851 CCTCCTGACC GCGTTCCTCA GCCTCCTCTC CTAGGTCCCT CCGCCCGACA 
8901 GTTGTGCTTT GTTGTGGTTG CAGCTGTTTT CGTGTCATGT ATAGTAGTAG 
8 951 AAATGGAAAT CATTGTACTG TAAAAGCCTA GTGACTCCCT CCTTGGCCAG 
9001 GCCCTCACCC AGTTCAGATC CACGGCCTCC ACCCGGGACG CCTTCCTCCT 
9051 CTGCTCCCAA ACAGGGTTTC CGTGGCCTGT TTGCAGCTAG ACATTGACCT 
9101 CCGCCATTGA GCTCCACGGT TTACAGACAA TTGCACAAGC GTGGGGTGGG 
9151 CAGGCCAGGA CTGCTTTTTT TTAATGCTCC CATTTCACAG AGGATACCAC 
9201 CGAGACTCGG AGGGGACACG ATGAGCACCA GGCCCCACCT TTGTCCCCTA 
9251 GCAAATTCAG GGTACAGCTC CACCTAGAAC CAGGCTGCCC TCTACTGTGC 
9301 TCGTTCCTCA AGCATTTATT AAGCACCTAC TGGGTGCTGG GTTCACTGTG 
9351 TCCTAGGAAA CCAAGAGGGT CCCCAGTCCT GGCCTCTGCC CGCCCCTGCT 
9401 GCCCCACCAC CTTCTGCACA CACAGCGGTG GGGAGGCGGG GAGGAGCAGC 
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9451 
9501 
9551 
9601 
9651 
9701 
9751 
9801 
9851 
9901 
9951 
10001 
10051 
10101 
10151 
10201 
10251 
10301 
10351 
10401 
10451 
10501 
10551 
10601 
10651 
10701 
10751 
10801 
10851 
10901 
10951 
11001 
11051 
11101 
11151 
11201 



TGGGACCCAG 
GGTCTTCTCC 
TCAGGCATGG 
TCTTAACAAG 
TAGGGCCCTA 
AGAGAGGGTA 
AGAAGCAAGC 
TCTTGATGCT 
CAACCCTCTC 
ACCATGCCCA 
CATGCTCCAG 
CCCTGCCTGC 
CCTTCCCCAC 
GCACCGGTGT 
TCTCCGTGTA 
TGCCCATGTG 
ACCAAAGCTG 
CTGAGGGTCA 
GATAAGTCCC 
GGGTCCTCCG 
CCCCGTGGAC 
ATTCCTGTAC 
GAAACCTTTT 
GGAGGCCAGG 
ACCCCCACCC 
CCTTACTTAG 
ATGGATTTGT 
TGTGAAGTGA 
GAAGGTCTGG 
CTTCCCTGGG 
GTGTGTTTTA 
GAAGCCCACT 
ATATCTTGTC 
ACACCAGAGG 
GGGAGGTGGA 
TCAGGGAGCC 



AACTGAGCCT 

TTGTGCCCGG 

TGGGTTTTGA 

TATATTTTGT 

AGGAACCTTA 

AGTAACTTCC 

TAGCGCTGGG 

TGGCCCCTGG 

CCTAGATCAG 

TGTATATACA 

CCCCAGCTCT 

GTCTGTGCTC 

CCACTGCCTG 

GTGGCCCGCT 

CACTATCAAT 

TATTTTTTGT 

GCCTTTCCCT 

GCACTGAGTC 

CCAGATGGAA 

GGTGTGTGTA 

CATTAGCTCC 

AGCAGCAATT 

CCAAGCCTGG 

GACACCATTC 

CACTGAGAAT 

ACTCACCCCA 

TGGACCACAT 

GACAGGATGT 

GCCCTGACCA 

CAGAGGGAAA 

CCCAGCGAGT 

CATGTTCACC 

CAAGATGGCA 

CTGTCTGATT 

AACCATCTGT 

AG G C CAT AAA 



GGGAGGGATC 
GATTGGGCTA 
ACCCATAAAC 
ATTTTAATCT 
GTGATCTTCT 
TCCAGGTCAC 
CATTCAGTAC 
GGCCTTCAGG 
TCTGTGAGGG 
AGTACACACA 
GCATACCTGC 
AAAGCAGCAG 
AGCCTTCTGA 
CTCACCCAGG 
AAAAGTGGGT 
ATTTCCAAGA 
CCCAAAATGC 
CACCTTCAAG 
GGTGATGCCC 
CCGAGGTGTC 
AGGAGGATCT 
TCACCCGCAG 
GGCTGGGGCT 
AGTATCCTCC 
TATCTGGCCT 
GGGGCTGGGA 
TCTGGACGGA 
GGGTGAGGAT 
ACACGGAATG 
GGAGGAAGTC 
CTCACGCCAT 
CCATCTGAGC 
CAGCTGGTGA 
TCCGTCTGGC 
GCTTTTGATC 
G 



CGACAGAAAA 

TGCTGGGTAC 

CAAAGGCCCT 

CTCTAAACAT 

ATTGGGTCTT 

ACAGCAAGTC 

ATACCACGAT 

GCTTTGGGAC 

TCCCTGTAGA 

GATGTACACA 

ACCTGCACCC 

CTCCAACCCT 

GCAGACCAGG 

CACAGCCCCG 

TTGTTACAAA 

GGAGGTGTGC 

ACCTGCCGTG 

TGTAAGTGTG 

TCCTTCAGCC 

TGTGTCCACA 

CCGTGTCTGA 

GGGACAGTTG 

GCTACTCTCA 

AACGCACAGG 

CAAATGCCAA 

CACGCCCCCA 

ACCCACAGCA 

GGAAAGT GG A 

TGCCCCCTGG 

AGTGAGGTAA 

CCTATCACCC 

ATTTAGGCTC 

AGTGGCAGAT 

TGAAGAAAGA 

AGCAAATGCC 



GCTCAGGGCG 
CACCATGTAC 
TGTCATCAGC 
ATTGAAGTTT 
TCTGAGGTTC 
TGTGGGTGGC 
GTGCTCCCTC 
ATCTTGTCCT 
TATTGTGTAC 
CAGATGTACA 
CAGCCTTGGC 
GCCTCTGTCC 
TACCTTGGCT 
CCACCATGGA 
GCCGTGTCCT 
CCCTTTCCAG 
TACCCTGGCC 
GGGAGAGGGG 
TGGCCCTCCT 
AAGAAGGGGC 
GTTCTTTGTG 
GCAATCTCTG 
TCTGGTGGGT 
ATGCCCCTCC 
GCGTGGGCAG 
CCTGCGTGTG 
TAAGC ACT CC 
GGCTGAGGGA 
GACTGAGAGG 
AATACTCCCT 
AGCCCCGAGG 
AGAGAGCTCA 
CAGAGATTCA 
TTTTGCATCA 
ACCAGCAGGA 



FEATURES: 

Start: 
Exon: 
Intron: 
Exon : 
Stop: 



3000 
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7694-8220 
8221 



CHROMOSOME MAP POSITION: 
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G 
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4513 


C 


T 


Intron 




6894 


C 


A 


Intron 




8409 


G 


C 


Beyond 


0RF(3 T ) 


8437 


T 


G 


Beyond 


ORF(3 r ) 


8579 


T 


C 


Beyond 


ORF(3') 


10292 


A 


G 


Beyond 


ORF(3' ) 


10792 


A 


G 


Beyond 


ORF(3' ) 



Context : 
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CACACCCCAGGCCTTTGCCACCTCCTAGAGAGGTAAGGGCAGGACCCAGGCAGTGATCAC 
CAAAGGGAAGGGGGCTTGGTCATGGTCATAGTGATGGTGATGGCACTAGCTGACACTTAT 
CAGAAGCTATGGGCCTGGCCCTGTTCTTAGAGCTTGGCATGTAGTTTTTTTTGAAACAGA 
GTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAACCTCT 
GCCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGTCCCCCAAGTAGCTGGGACTACAGGCA 
[C,T] 

GTGCCACCATGCCCGGCTAATTTTTTGTATTTTTACTAGAGACGGGATTTCACCATGTTA 
GCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTACCTCAGCCTCCCAAAGTGCT 
GAGATTACAGGCGTCAGCCACCGCGGCCAGCCAGCATGTAGTTATTTAACCCTCACAGTA 
AATAGTTATTCATTCCCTTTTTACAGGTGGGGAAACTGAAGCCCAGAGAGGTTAAGTAAC 
TCACTCCAGTGGTAGCACAGCTCGTAAAGGCAGTCTGCTTTTGTGCTTTCAGACAAAGCC 

CAGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAAC 
CT CTGCCT CC CGGGTT CAAGCGATT CT CCTGCCT CAGT CCCCCAAGT AGCTGGGACT ACA 
GGCACGTGCCACCATGCCCGGCTAATTTTTTGTATTTTTACTAGAGACGGGATTTCACCA 
TGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGAT CTGCCT ACCTCAGCCTCCCAAA 
GTGCTGAGATTACAGGCGTCAGCCACCGCGGCCAGCCAGCATGTAGTTATTTAACCCTCA 
[C,T] 

AGTAAATAGTTATTCATTCCCTTTTTACAGGTGGGGAAACTGAAGCCCAGAGAGGTTAAG 
TAACTCACTCCAGTGGTAGCACAGCTCGTAAAGGCAGTCTGCTTTTGTGCTTTCAGACAA 
AGCCATACCACAGCCTCTCAGCCCTGCTGGGAAGGGTGAGGAGGGACAGGGAGGTTGGGG 
GGAAGAAGGGGTGAGTGGAGCTGAGGGGCTGTGCCCTTGTTTACACTGCATTAGCATGGT 
AGCTAAGAGGACAAGCCCGGGACCCAGCACCTGGGTGTGAGCCCTGGTTCCGCTGCTTCC 

TGCAGTGGCGCGATCTCGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCGATTCTCCT 
GCCTCAGTCCCCCAAGTAGCTGGGACTACAGGCACGTGCCACCATGCCCGGCTAATTTTT 
TGTATTTTTACTAGAGACGGGATTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGAC 
CTCGTGATCTGCCTACCTCAGCCTCCCAAAGTGCTGAGATTACAGGCGTCAGCCACCGCG 
GCCAGCCAGCATGTAGTTATTTAACCCTCACAGTAAATAGTTATTCATTCCCTTTTTACA 
[G,A] 

GTGGGGAAACTGAAGCCCAGAGAGGTTAAGTAACTCACTCCAGTGGTAGCACAGCTCGTA 
AAGGCAGTCTGCTTTTGTGCTTTCAGACAAAGCCATACCACAGCCTCTCAGCCCTGCTGG 
GAAGGGTGAGGAGGGACAGGGAGGTTGGGGGGAAGAAGGGGTGAGTGGAGCTGAGGGGCT 
GTGCCCTTGTTTACACTGCATTAGCATGGTAGCTAAGAGGACAAGCCCGGGACCCAGCAC 
CTGGGTGTGAGCCCTGGTTCCGCTGCTTCCTGGCTTTGTACCTCGAGGCAAGGGATTTTA 

TCGTAAAGGCAGTCTGCTTTTGTGCTTTCAGACAAAGCCATACCACAGCCTCTCAGCCCT 
GCTGGGAAGGGTGAGGAGGGACAGGGAGGTTGGGGGGAAGAAGGGGTGAGTGGAGCTGAG 
GGGCTGTGCCCTTGTTTACACTGCATTAGCATGGTAGCTAAGAGGACAAGCCCGGGACCC 
AGCACCTGGGTGTGAGCCCTGGTTCCGCTGCTTCCTGGCTTTGTACCTCGAGGCAAGGGA 
TTTTATCTCCTTGTGTCTCAGCACTCTCATCTGTAAGACTGCACCACATCAACACTCATC 
[C,T] 

TAAAGGGACTGTGAGACTTAAATGAATGAATATATGTAAGGCGCTTGGTGAGCAGATAGT 
AAATGCACAATAAATCCCCAAGTCTTCTGTTGAGTCAGCATTTGCAAGTGGGCCTGCTAC 
GGGTTACACGATCATTTCCCAAGTCACGCCCCTGAAGTTGCTGAGCAGGGATAAGGGAAG 
GAGTGAGCAGGCAACTCTCTAGGCATCATTCAGATAACCCCCCAACTGAGGTACTTCTAT 
ACAGAGAAACCCATGCCACTCCCAGCCCTGCTGCCGCCTTGAGGCCAAGACTGAGGCTGC 

GCACTCCAGCCTGGGCCACAGAGAAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAGTTTAAGCAAAAGTGAGGAAGGTGCTTATTAAAAGCTGGAAATCAGGATGGAGGTACC 
AGTCCAGACAGCCTCCCCACCACCCCACCGTCTCCACAGCAGCCCCTGTTTCAGATTCAC 
AAGCCTGCCTTGAGTGATGCAGTGAGTTATCCTGGAGGCAGTGTGGGCCTTGGAGGCCAG 
CACTCACTTTTTCATCCTATGATTTATTTGAGAAGCAGAGAGCACCTACCGGGTGCCAGG 
[C,A] 

ACGAGCTAGGTGAGAACAGAATCAGGTAGAAATCTCAGCCTAGCCACACGGAAGCTGTGT 
GATCTTGGGCAGGCTGCATACCCTTTCTGAGCCTCAGTTTGCTCACCTGTAATGCAAAGG 
TAACAAAATCTTGACAGAGGCATAGTGAGGAATCAAGAGAACAACGGGCCTGGAGCATAC 
ACCCAGTGCTTAGCCCCCAGTAGGCCCTCACTCTCATCATTACTGACACCTGAGGTCACT 
GAGCATGTGCCACTGTCCATTCATTATCTTGCATAACTCCCAAAATCATCCTGCAAGGTA 
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84 09 CATGGTCTCGCCCTTCGCCCGCCGCCCCAGCGTCAACAGTGACCTCAAGTACATCAAGGC 
CAAGGTCCTTCGGGAAGGCCAGGCCCGTGAGAGGGACAAGTGCACCATCCAGTGAGCGAG 
GGATGCTGGGGCGGGGCTTGGCCAGTGCCTTCAGGGAGGTGGCCCCAGATGCCCACTGTG 
CGCATCTCCCCACCGAGGCCCCGGCAGCAGTCTTGTTCACAGACCTTAGGCACCAGACTG 
GAGGCCCCCGGGCGCTGGCCTCCGCACATTCGTCTGCCTTCTCACAGCTTTCCTGAGTCC 
[G,Cj 

CTTGTCCACAGCTCCTTGGTGGTTTCATCTCCTCTGTGGGAGGACACATCTCTGCAGCCT 
CAAGAGTTAGGCAGAGACTCAAGTTACACCTTCCTCTCCTGGGGTTGGAAGAAATGTTGA 
TGCCAGAGGGGTGAGGATTGCTGCGTCATATGGAGCCTCCTGGGACAAGCCTCAGGATGA 
AAAGGACACAGAAGGCCAGAT GAGAAAGGT CT CCT CT CT CCTGGCATAACACCCAGCTT G 
GTTTGGGTGGCAGCTGGGAGAACTTCTCTCCCAGCCCTGCAACTCTTACGCTCTGGTTCA 

84 37 AGCGTCAACAGTGACCTCAAGTACATCAAGGCCAAGGTCCTTCGGGAAGGCCAGGCCCGT 
GAGAGGGACAAGTGCACCATCCAGTGAGCGAGGGATGCTGGGGCGGGGCTTGGCCAGTGC 
CTTCAGGGAGGTGGCCCCAGATGCCCACTGTGCGCATCTCCCCACCGAGGCCCCGGCAGC 
AGTCTTGTTCACAGACCTTAGGCACCAGACTGGAGGCCCCCGGGCGCTGGCCTCCGCACA 
TTCGTCTGCCTTCTCACAGCTTTCCTGAGTCCGCTTGTCCACAGCTCCTTGGTGGTTTCA 
[T r G] 

CTCCTCTGTGGGAGGACACATCTCTGCAGCCTCAAGAGTTAGGCAGAGACTCAAGTTACA 
CCTTCCTCTCCTGGGGTTGGAAGAAATGTTGATGCCAGAGGGGTGAGGATTGCTGCGTCA 
TATGGAGCCTCCTGGGACAAGCCTCAGGATGAAAAGGACACAGAAGGCCAGATGAGAAAG 
GTCTCCTCTCTCCTGGCATAACACCCAGCTTGGTTTGGGTGGCAGCTGGGAGAACTTCTC 
TCCCAGCCCTGCAACTCTTACGCTCTGGTTCAGCTGCCTCTGCACCCCCTCCCACCCCCA 

857 9 GCCCACTGTGCGCATCTCCCCACCGAGGCCCCGGCAGCAGTCTTGTTCACAGACCTTAGG 
CACCAGACTGGAGGCCCCCGGGCGCTGGCCTCCGCACATTCGTCTGCCTTCTCACAGCTT 
TCCTGAGTCCGCTTGTCCACAGCTCCTTGGTGGTTTCATCTCCTCTGTGGGAGGACACAT 
CTCTGCAGCCTCAAGAGTTAGGCAGAGACTCAAGTTACACCTTCCTCTCCTGGGGTTGGA 
AGAAATGTTGATGCCAGAGGGGTGAGGATTGCTGCGTCATATGGAGCCTCCTGGGACAAG 
[T,C] 

CTCAGGATGAAAAGGACACAGAAGGCCAGATGAGAAAGGTCTCCTCTCTCCTGGCATAAC 
ACCCAGCTTGGTTTGGGTGGCAGCTGGGAGAACTTCTCTCCCAGCCCTGCAACTCTTACG 
CTCTGGTTCAGCTGCCTCTGCACCCCCTCCCACCCCCAGCACACACACAAGTTGGCCCCC 
AGCTGCGCCTGACATTGAGCCAGTGGACTCTGTGTCTGAAGGGGGCGTGGCCACACCTCC 
TAGACCACGCCCACCACTTAGACCACGCCCACCTCCTGACCGCGTTCCTCAGCCTCCTCT 

102 92 AGCCTTGGCCCCTGCCTGCGTCTGTGCTCAAAGCAGCAGCTCCAACCCTGCCTCTGTCCC 
CTTCCCCACCCACTGCCTGAGCCTTCTGAGCAGACCAGGTACCTTGGCTGCACCGGTGTG 
TGGCCCGCTCTCACCCAGGCACAGCCCCGCCACCATGGATCTCCGTGTACACTATCAATA 
AAAGTGGGTTTGTTACAAAGCCGTGTCCTTGCCCATGTGTATTTTTTGTATTTCCAAGAG 
GAGGTGTGCCCCTTTCCAGACCAAAGCTGGCCTTTCCCTCCCAAAATGCACCTGCCGTGT 
[A, G] 

CCCTGGCCCTGAGGGTCAGCACTGAGTCCACCTTCAAGTGTAAGTGTGGGGAGAGGGGGA 
TAAGTCCCCCAGATGGAAGGTGATGCCCTCCTTCAGCCTGGCCCTCCTGGGTCCTCCGGG 
TGTGTGTACCGAGGTGTCTGTGTCCACAAAGAAGGGGCCCCCGTGGACCATTAGCTCCAG 
GAGGATCTCCGTGTCTGAGTTCTTTGTGATTCCTGTACAGCAGCAATTTCACCCGCAGGG 
GACAGTTGGCAATCTCTGGAAACCTTTTCCAAGCCTGGGGCTGGGGCTGCTACTCTCATC 

107 92 TTCTTTGTGATTCCTGTACAGCAGCAATTTCACCCGCAGGGGACAGTTGGCAATCTCTGG 
AAACCTTTTCCAAGCCTGGGGCTGGGGCTGCTACTCTCATCTGGTGGGTGGAGGCCAGGG 
AC AC CAT T C AGT AT C CT C C AACG CACAG GAT G C CC C T C C AC C C C C AC C C C ACT GAGAAT T 
ATCTGGCCTCAAATGCCAAGCGTGGGCAGCCTTACTTAGACTCACCCCAGGGGCTGGGAC 
ACGCCCCCACCTGCGTGTGATGGATTTGTTGGACCACATTCTGGACGGAACCCACAGCAT 
[A,G] 

AGCACTCCTGTGAAGTGAGACAGGATGTGGGTGAGGATGGAAAGTGGAGGCTGAGGGAGA 
AGGTCTGGGCCCTGACCAACACGGAATGTGCCCCCTGGGACTGAGAGGCTTCCCTGGGCA 
GAGGGAAAGGAGGAAGTCAGTGAGGTAAAATACTCCCTGTGTGTTTTACCCAGCGAGTCT 
CACGCCATCCTATCACCCAGCCCCGAGGGAAGCCCACTCATGTTCACCCCATCTGAGCAT 
TTAGGCTCAGAGAGCTCAATATCTTGTCCAAGATGGCACAGCTGGTGAAGTGGCAGATCA 
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